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Abstract 

Measurements are never absolutely accurate; as a result, after each measurement, we do not get the 
exact value of the measured quantity: at best, we get an In terval of Its possible values. 

For dynamically changing quantities x, the additional problem is that we cannot measure them 
continuously; we can only measure them at certain discrete moments of time 1 1 , t 2, ■ ■ ■ . If we know that 
the value x(t 3 ) at a moment t 3 of the last measurement was in the interval [x~(tj ), £ + (q)j,and if we 
know the upper bound D on the rate with which x changes, then, for any given moment of time t, we 
can conclude that x(t) belongs to the interval [x~(tj ) — D . ( t — (, ), r (q ) + D . (t — b)]- This interval 
changes linearly with time, an is, therefore, called a linear interval function. 

When we process these intervals, we get an expression that is quadratic and higher order w.r.t. time 
t. Such “quadratic” intervals are difficult to process and therefore, it is necessary to approximate them 
by linear ones. 

I n this paper, we describe an algorithm that gives the optimal approximation of quadratic interval 
functions by linear ones. 


1 Introduction: intervals, linear and quadratic interval func- 
tions, and why it is necessary to approximate 

The need for indirect measurements. In many real-life problems, we are interested in the values of a 
quantity y that is difficult or impossible to measure directly. For example: 

• in astrophysics, we cannot directly measure the mass or the temperature of the star; 

• i n geophysics, we cannot directly measure the amount of oil in a given area, etc. 

In all these situations, we measure some related quantities xl, . . . . x n that can be measured directly, and 
then use the known relationship y = f(x 1 , .... x n ) between and y and the results %i of measuring %i to 
estimate y as y =f(x 1 , .... x n ). For example: 

eto measure the temperature of the star, we measure its brightness ii,....i n on different wavelengths, 
and then try to fit the resulting spectrum into a black-body radiation spectrum corresponding to the 
temperature y; 

e to estimate the amount of oil y in a given area, we measure the geophysical characteristics of different 
parts of it, and use known equations to estimate y. 

In contrast to a direct measurement of x { , this two-stage process of direct measurements followed by data 
processing is called indirect measurement. 

In space sciences, and especially in their applications to environmental and earth sciences, indirect mea- 
surements are the main method of measurement. 
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The main problem: estimating accuracy of the result of data processing. After the data processing 
is over, the main problem is: how accurate is the result y of data processing? 

For example, suppose that we estimate the amount of oil in a given area, and we got y = 100 min ton. 
Our further actions will depend on the accuracy of this result: 

. if the accuracy is high, e.g., if the actual amount of oil is 100 ± 1, then we will probably start drilling; 

• however, If the accuracy Is low, and the actual value If, say, 100 ± 100, then It may make more sense 

to undertake further, more accurate, measurements. 

The inaccuracy in y comes from two sources: 

. First, the model y = f(x i,. . . . x n ) that is used in data processing is often inaccurate. This inaccuracy 
is the easiest to take into consideration: if we know the upper bound A m on the error y- f(x 1 , . . . . x n ) 
of this model, and if we know the values of Xi, then we can conclude that the actual value of y belongs 
to the interval [y- A m ,y + A m ], where y =f{x\,. .,x n ). 

. Second, the measurement results x> are also inaccurate and therefore, may differ from the actual values 
of the measured quantities x,-. This error is much more difficult to estimate, and this is what we will 
be doing in the present paper. 

Intervals. Measurements are never 100% precise. Thus, if, as the result of measuring a certain quantity 
x;, we get a measurement result Xi, it does not necessarily mean that the actual value x, of this quantity is 
exactly equal to x,-. 

The manufacturer of the measuring instrument usually supplies it with the upper bound A< for the 
measurement error A x,=Xi- x,-; in order words, the manufacturer guarantees that |Axj|<A;. 

I n this case, if we have measured a quantity x* and the measurement result is x,-, then the only information 
that we have about the actual value is that this actual value cannot differ from x. by more than A,-, i.e., 
that this actual value must be within the interval [x;- A;,x> +As], 

If we know these intervals, then the set Y of possible values of y = /(x i, .... x„) can be described as 
follows: 

Y = {/(xi,. ,.,x n )|xi 6 [xj",®?'],. ,.,x„e [x~,x+]}- (1) 

Comments. 

e Sometimes, in addition to the upper bound for the error, we know the probabilities of different error 
values. Flowever, in many real-life cases, we do not know these probabilities, and the upper bound A; 
is the only information about the measurement error Ax* that we have. 

• Computations that take this interval uncertainty into consideration are called interval computations 

(see, e.g., [1, 2]). 

Linear interval functions. In some cases, the values x, do not change with time. In these cases, the only 
inaccuracy is caused by the inaccuracy of the direct measurement. 

Flowever, in many real-life situations, the measured values change with time. For such dynamically 
changing quantities xi, there is an additional source of uncertainty: namely, we cannot measure a quantity 
continuously; we can only measure it at certain discrete moments of time t i, < 2 , . . .Flence: 

e If we are interested in the value of x; ( tj ) at one of the moments of time tj in which we have actually 
measured x,-, then the inaccuracy of the direct measurement is the only source of the measurement 
error, and we get the interval [x~(tj ), xf(tj )] of possible values of x,(fj ). 

e However, if we are interested in the value xi (t) of the quantity at a moment of time t in which no 
direct measurement of xi was performed, then we must use one of the measured values, and get an 
additional error component caused by the change of x,- between the moment tj of the last measurement 
and the desired moment t. There are two methods to take this additional error into consideration: 

If we know the upper bound D , on the rate with which z changes, then, for any given moment 
of time t, we can conclude that Xi(tj) - Di . (t — fj )<x,(f)<x,(<j) + A . (t — tj). Hence, from 
the fact that x^(tj) <x,(fj) <x + (tj), we conclude that the value Xi(t) belongs to the interval 
[x,T(f),xt(f)], where z;(t)= x j~ \tj ) - D, - (t- tj ) and x+(f) - xf(tj) + £>,-• (f— tj). 
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- In some cases, simultaneously with measuring the values x i (tj ), we can directly measure the rate 
with which these values change; e.g., 

* simultaneously with measuring coordinates xi(t) of a robot, we can measure its velocity 

* simultaneously with measuring the velocity xi @), we can measure the acceleration etc. 

I n these case, if we know the interval of possible values [ir (tj ), it (tj )] of the rate, and we know 
that the upper bound Si on the second time derivative, we can conclude that: 

* for all t £ [tj, fj + i], the values ii(tj ) belong to the intervals [v,t> + ], where 

v- = i~(tj) - Si (tj + i - tj) and v=xf(tj)+ Si . (tj+\ - tj), and therefore, that 

* for each of these moments of time, the actual values of xi (t) belong to the interval 
[z _ (/),a; + (f)], where Z-(t)= xf(tj) - v~ . (t ~ <j)and Z-Kt)= xf(tj) + v + ■ (t — tj). 

In general, the only information that we have about the actual value of x(t) is that x(t) £ x(t) = 
[x- (t) , x + (f )] . In other words, instead of a single interval, we have an interval function that assigns an 
interval x(t) to each moment of time t. 

In our cases, the endpoints x~ (t)and z + (t) of this interval are linear functions and therefore, it is natural 
to call this interval function linear interval function. 

For linear data processing, linear interval functions remain linear. If the algorithm f(xi, .. .,x. ) 
is a linear function, then, as a result of applying this algorithm to the interval linear functions, we get an 
estimate ( 1 ) that is either a linear or a piece-wise linear function of time t. 

In [3, 4, 5], such linear interval functions were effectively used in electrical and electronic engineering 
analysis of circuits with linear elements. 

For non-linear data processing, we need an approximation. In most real-life situations, however, 
data processing algorithms are non-linear. As a result, if we start with the intervals for xi (t) that are linear 
in time t,ive end up with intervals for y(t) whose dependence on t is much more complicated. For example: 

• if we add or subtract two intervals that are linear in time, we still get the result that is linear in time; 

but 

• if we multiply two intervals that are linear in time, then their endpoints also get multiplied, and, as a 
result, we get an interval function [y~ (t ), y + (t )] in which the endpoints are quadratic functions of time 
(i.e., we get quadratic interval functions). 

If we multiply more, we get cubic, quartic, etc. functions. In principle, there is nothing wrong with this 
complexity, except for the fact that while a linear interval function requires only 4 numbers to store (2 
coefficients of the lower endpoint and 2 coefficients of the upper endpoint), quadratic functions require 6 
coefficients, cubic functions require 8, etc. The more complicated the function becomes, the more memory 
we need to store these coefficients, and the longer it takes to process these functions. 

Thus, since we are often limited both in processing time and in memory (especially if the processing is 
done in an on-board computer), we must approximate the given complicated interval function by a simpler 
one. 

I n other words, if we have an interval function y(t)~ [y(t ), y + (t )] that is known to contain the actual 
value y(t), we want to be able to find a simpler interval function z(t)= [z~(t ), z'(t)] that for each t, contains 
the entire interval y(t) and is, thus, guaranteed to contain the actual value y(t). 

Of course, this approximation comes at a trade-ofwe simplify the expression, but we make the interval 
wider (and therefore, lose some information). Therefore, the narrower the approximating interval function, 
the better. 

What we are planning to do. The simplest approximation problem of this type is the problem of 
approximating a quadratic interval function by linear ones. In this paper, we will present an optimal solution 
to this problem. 
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2 When is the approximation optimal? Mat he mat ical formula- 
tion of the problem 

Definition 1. Let an interval [t',t + ] be fixed. 

e By an interval junction , we mean a mapping x that puts into correspondence to each number tE[f,t + ] 
an interval x(t) = [x-( t), 

• If both functions x~(t) and x + (t) are linear functions oft (i.e., if x^(t) -l-a;^ ■ t ), we sa V that the 

interval fun ction is linear. 

• If both functions x~(t) and x + (t) are quadratic functions oft (i.e., if x ± (f)= < + x± . t 2 ), we 

say that the interval function is quadratic. 

• We say that a linear interval function z(<) = [zft), z + (t)] approximates a quadratic interval function 
y(t) = ly-(t), y + (t)] if y(t) C z(t) for all t. 

Comment. The narrower the intervals, the better. So, our goal is to minimize the worse-case width of the 
approximating interval, i.e., the value W(z) = max ( (z + (f)- z (<)). 

We will see that in some cases, for a given quadratic interval function y(t), there are several approximating 
linear interval functions z with the same value of W(z). If two different approximating functions have the 
same worst-case widths, then it is reasonable to choose the one for which the best-case width w(z) = 
min ( (z + (<) — z~ ( t )) is the smallest. Thus, we arrive at the following definition: 

Definition 2. 

• For every interval function z(t): 

- by its worst-case width, we mean a value 

w(z) = max ( z + (t. ) - z-(t)). 


- by its best-case width, we mean a value 

w(z)= min (z + (f) — z~(<)). 

te[<-,t+ ] 

• Let a quadratic interval function y be fixed. We say that a linear function z is an optima! approximation 
ofy if the following conditions are satisfied: 

- first, z is an approximation of y. 

" second, among all linear approximations to y, the function z has the smallest value of the worst- 
case width W(z); 

~ third, if there exist several linear approximations U to y, with the same smallest value of the 
worst-case width W’(u), the function z has the smallest value of the best-case width w(z). 

What we are planning to do. In this paper, we will describe the optimal linear approximation to an 
arbitrary quadratic interval function. 

The resulting formulas will depend on the signs of the coefficients ( y~ and yj) at the quadratic term t 2 . 1 n 
each of the resulting four caaes, we will describe the geometric idea that leads to the optimal approximation, 
and give the explicit formulas for this approximation. 

3 Case 1: < 0, > 0 

Derivation of the solution. In this case, both functions y (a;) and y + (z) are "pointing inward". 

Ifz(f)j'sa linear interval approximation to the given quadratic interval function, then, from y(t)Cz(t), it 
follows, in particular, that z-(t)_< y-(t) for all t £ [t~ , <+]; therefore, z~(t~) <y~ (t~) and z~ (t + )<y~ (t + ). 

Since yf< 0, the quadratic function y (<) is concave and hence, from z (f) < y(t~) a n d 
z-(t+)<y (t+),i.e., from the fact that the line z (<) lies below the two endpoints of the graph of y (t), it 
automatically follows that the entire graph of y (f) is above the line z (t). So, it is sufficient to guarantee 
that at the endpoints, the values z^*) do not exceed the corresponding values of y^f*). 
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For fixed z + (t), the resulting intervals are the narrowest when the line z (t) is at the highest possible 
location. Thus, to minimize the widths of the intervals, we must move both points z~(t ± ) up as much as 
possible. The highest possible location for z~(t~) is y (t ), and the highest possible location for z (< + )is 
y~(t + ). Thus, y-(t) is a a straight line going through )/ _ (< ± ), i.e., a secant. 

Similarly, z + (t) is a secant of y + (t). 

Solution. In the optimal approximation, the function z~(t) is the secant of y~ (t), i.e., a straight line whose 
endpoints are the endpoints of the quadratic function y (t) on this interval. Similarly, z + (t) is a secant of 
y+(t), i.e., 

= „-(»-) ,T (r) -(,-n (2) 


r ~(t) = y + (t~) + 


y+(t+)-y+(t-). 


t+ Z i~ 


(< -t-). 


(3) 


4 Case 2: y 2 < 0, y 2 < 0 

Derivation of the solution. The arguments given for Case 1 show that in this case, the function z~{i) 
is still the secant. With z (t) fixed, it is sufficient to find the upper function z+(t) for which the resulting 
approximation is optimal. 

Let us first guarantee that the worst-case width is indeed the smallest possible. If this width is equal 
to W 0 , this means that z + {t) < z'(t) + Wo, and therefore, that y + (t)<z + (t)<z~ (t) +W, B . Thus, to 
guarantee that W 0 takes the smallest possible values, we must choose W^oas the smallest possible value for 
which y + (t) < z (t ) + W 0 for all t g [< ”, t + ], In other words, as W'., we take the maximum of the function 
y + (<)- z-(t) on the interval [t~ ,t + ]. 

The maximum of the concave quadratic function y + (t) - z(t) = (yj- 2 ^) + (y^ - zf ) . t + y^t 2 is 
attained at the point where its derivative is equal to O, i.e., at the point 

t m - ~(yf ~ h)/(2yt)- ( 4 ) 

If this maximum is attained at the internal point t m of this interval, then at f m , the line z~ ( t ) + Wo is a 
tangent to y+(t), and therefore, there is no way to find a lower line without increasing the worst-case width 
W( z). So, in this case, z + (t) = z-(t)+ Wo- 
lf the maximum is attained in one of the endpoints, e.g., at t + , then, we can, keeping the straight line 
z + (t) at the point (t+, y+ (t+)), lower its other end and still get the same worst-case width. The lowest value 
of the best-case width is attained when we lower the other end to the lowest possible position in which it is 
still above y + (t), i.e., to the position of a tangent to y + (t). 

Solution. For the optimal approximation, the lower line z~(t) is a secant (2). To determine the upper line 
z + (t), we apply the formula (4) to compute the value im- Then: 

• If t m G [t- t+], we take z + (t) =z~(t ) +(y + (t m )~ z~(t m )). 

o If t m > t+ , then 2 + is the tangent to y + at f+ : 

z + (t) = y + (t + ) + (yt+2y+ -t + )(t -t + ). (5) 

o If < m < t , then z+ is the tangent to y + at t : 

z + (t) = y + (t~) + (y+ + 2 y+ ■ t~)(t - t~). (6) 

5 Case 3: y 2 > 0, y 2 > 0 

This case is similar to case 2, so we can immediately give a solution: 

Solution. For the optimal approximation, the upper line z + (t)is a secant (3). To determine the lower line 
z-(t), we compute the value t m =— (j/f — 2 1 f )/(2y^). Then: 

o Iffm€[rV + ], we take z-(t) = z + (t) - (z + (t m ) - y~(t m )). 
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• If t m > <+, then z is the tangent to y at t+: 

z-(t) = y~(t + ) + (yf + 2j/ 2 - t + )(t- 1 + ). (7) 

• If t m < t~ , then z~ is the tangent to y at t~: 

z~(t) -y~(t~) + (yf + -t-)- (8) 

6 Case 4: <0, y£ < 0 

Derivation of the solution. In this case, both functions y~{x) and y + (x) are "pointing outward". 

As a first step of constructing the optimal linear approximation z(t), let us first make sure that we have 
the smallest possible value of the worst-case width. For every t £ [<~, f + ],from y{t)Cz(t), we can conclude 
that the width of z(t) is at least as large as the width of the interval y(t). Thus, the worst-case width W(z) 
of the approximating linear function z(t) cannot be smaller than the worst-case width W(y) of the original 
(quadratic) interval function y. Since we are minimizing W'(z), it is therefore desirable to choose z in such 
a way that its worst-case width is exactly equal to W(y). 

The worst-case width W(y) is a maximum of the quadratic width function y + (t)- y (t) on the interval 
[t , <+]. By differentiating this difference, one can easily get an explicit expression for this maximum point 
tu (see below). 

If this maximum point t M is inside the interval [t, <+],then at this point t M , both approximating lines 
Z(t) and z + (t) must be tangent to the corresponding functions y~ (t ) and y+(t), because otherwise, in at 
least one of the directions, the width will increase. 

If this maximum t\t is attained at one the endpoints, e.g., for t + , then we must have z~(t + ) = y(t + ) 
and z + (t *) =y + (t). In this case, to guarantee the smallest possible best-case width, we must place z + (t) 
as low as possible (i.e., along the tangent to t/ + ),and z~ (t ) as high as possible, i.e., similarly, along the 
tangent to y(f). 

As a result, we arrive at the following formulas: 

Solution. Compute t M = ~(yt~ y\)l[^(yt~ Vi >]■ Then: 

e If tM £ [f~, t + ], then z~ is the tangent to y~ at t/q , and z + is the tangent to y + at Im- z-(t) = 

y~(t\i) +(yi+ 2 y 2 <m)(< -<m); z + {t) =y + (t M ) +(yt + 2yt t M )(t -t M )- 

• If <m>< + , then z is the tangent (7) to y at t + , and z + is the tangent (5) to j/+at< + ; 

• If tM < t~ , then z~ is the tangent (8) to y at <_ and z + is the tangent (6) to j/ + at f_. 
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